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Abstract 

We show that equivalence of deterministic top-down tree-to-stting transducers is decidable, thus solving a long 
standing open problem in formal language theory. We also present efficient algorithms for subclasses: polynomial 
time for total transducers with unary output alphabet (over a given top-down regular domain language), and co¬ 
randomized polynomial time for linear transducers; these results are obtained using techniques from multi-linear 
algebra. For our main result, we prove that equivalence can be certified by means of inductive invariants using 
polynomial ideals. This allows us to consttuct two semi-algorithms, one searching for a proof of equivalence, 
one for a witness of non-equivalence. Furthermore, we extend our result to deterministic top-down tree-to-string 
transducers which produce output not in a free monoid but in a free group. 

I. Introduction 

Transformations of structured data are at the heart of functional programming lUl, ||2l, ||3l, Q, ||5l and also 
application areas such as compiling 0, document processing Q, ISl, |@, ifTOl . ifTTI . lIT^ . iflTl . automatic 
translation of natural languages |[T4l . IT5l . |[T^ . IITtI or even cryptographic protocols IITSl . The most fundamental 
model of such transformations is given by (finite-state tree) transducers 1191 . Ihll . Transducers traverse the input 
by means of finitely many mutually recursive functions — corresponding to finitely many states. Accordingly, 
these transducers are also known as top-down tree transducers If20l . |[^ or, if additional parameters are used for 
accumulating output values, macro tree transducers |[22ll . Here we only deal with deterministic transducers and 
denote them DT and DMT transducers, respectively (equivalence is undecidable already for very restricted classes 
of non-deterministic transducers |[23l ). 

When the output is produced in a structured way, i.e., in the case of tree-to-tree transducers, many properties, 
at least of transducers without parameters, are fairly well understood. An algorithm for deciding equivalence of 
DT transducers already dates back to the 80s Il24l . More recently, canonical forms have been provided allowing 
for effective minimization If25l as well as Gold-style learning of transformations from examples If26l . In various 
applications, though, the output is not generated in a structured way. This may be the case when general scripting 
languages are employed 1271, non-tree operations are required Il28l or simply, because the result is a string. 

Assume, e.g., that we want to generate a well-formed XML document from an internal tree-like representation 
where the elements of the document do not only have tags and contents but also attributes. The output for the 
input tree 

frame( 

defs(height(20), defs(width(50), end)), 
content(button(”DojioGpress!”), • • •) 

) 

then should look like: 

(frame,^height = ”20”.^width = ”50”) 

(button)DojioGpress! (/button) 

(/frame) 


This translation could be accomplished by a tree-to-string transducer with, among others, the following rules: 


g(frame(xi,X2)) 

qi{end) 

g'i(defs(xi,X2)) 

g2(height(xi)) 

g2(width(xi)) 

( 7 (button(xi)) 


(frame gi(xi)g(x 2 )(/frame) 

) 

q2{xi)qi{x2) 

^height = ’’qsixi)” 

^width = ”g 3 (xi)” 
(button)g 3 (xi)(/button) 


According to the peculiarities of XML, arbitrary many attribute value pairs may be positioned inside the start tag 
of an element. The given rules generate the closing bracket of the start tag from the node end which terminates 
the list of attribute definitions. At the expense of an empty right-hand side, the closing bracket could also be 
generated by the rule for the tag frame directly. In this case, the two first rules should be replaced with: 

g(frame(xi,X 2 )) —)• (frame gi(xi))g(x 2 )(/frame) 
gi(end) —)■ e 


while all remaining rules stay the same. This example indicates that already simple tasks for structured data may 
be solved by transducers processing their inputs in different ways. 

Following the tradition since Il29l . we denote tree-to-string transducers by prefixing fhe leffer y (which sfands 
for “yield”, i.e., fhe funcfion fhaf furns a free info fhe sfring of ifs leaf labels, read from lefl fo righf). In |[9l, |[28l 
dedicated transducers for XML have been introduced. Beyond the usual operations on trees, these transducers 
also support concatenation of outputs. Decidability of equivalence for these transducers has been open. Since 
they can be simulated by tree-to-string transducers, our main result implies that equivalence is decidable for both 
types of transducers (where for Il28l we mean the parameter-less version of their transducers). 

Amazingly little has been known so far for tree-to-string transducers, possibly due to the multitude of ways 
in which they can produce the same output string. As a second example, consider the transducer M with initial 
state q, on input trees with a ternary symbol / and a leaf symbol a, defined by: 


q{f{xi,X2,X3)) 

qi{f{xi,X 2 ,X 3 )) 

91 (e) 

9(e) 


q{x3)aqi{x2)bq{x2) 

qi{x3)qi{x2)qi{x2)ba 

ba 

ab. 


Furthermore, let M' be the transducer with single state q' and the rules: 


q'{f{xi,X 2 ,X 3 )) abq'{X 2 )q'{X 2 )q'{xs) 

q'{e) —>• ab. 


Some thought reveals that the transducers M and M' are equivalent, although the output is generated in a quite 
“un-aligned” way with respect to X 2 ,X 3 . Note that these two transducers do not fall into any class of tree-to- 
string transducers for which equivalence has been known to be decidable so far. One class where equivalence 
is already known to be decidable, are the linear and order-preserving deterministic tree-to-string transducers as 
studied in ll^ . A transducer is linear, if each input variable Xi occurs at most once in every right-hand side. 
A transducer is order-preserving if the variables Xi appear in ascending order (from left to right) in each right- 
hand side. Equivalence for these can be decided by a reduction to Plandowski’s result lf3Tl even in polynomial 
time ll30l . This class of transducers is sufficiently well-behaved so that periodicity and commutation arguments 
over the output strings can be applied to derive canonical normal forms |[32l and enable Gold-style learning |[33l . 
Apart from these stronger normal form results, equivalence itself can indeed be solved for a much larger class of 
tree-to-string translations, namely for those definable in MSO logic 041 . or equivalenfly, by macro free-fo-sfring 
franslafions of linear size increase ll35l . This proof gracefully uses Parikh’s fheorem and fhe fheory of semi-linear 


sets. More precisely, for a Parikh language L (this means L, if the order of symbols is ignored, is equivalent to 
a regular language) it is decidable whether there exists an output string with equal number of a’s and 6’s (for 
given letters a / h). The idea of the proof is to construct L which contains if and only if, on input t, 

transducer Mi outputs a at position n and transducer M 2 outputs b at position m. 

Our main result generalizes the result of llMll by proving that equivalence of unrestricted deterministic top- 
down tree-to-string transducers is decidable. By that, it solves an intriguing problem which has been open for at 
least thirty-five years |[29l. The difficulty of the problem may perhaps become apparent as it encompasses not 
only the equivalence problem for MSO definable transductions, but also the famous hdtOl sequence equivalence 
problem |[36ll . IfTTI . |[38l . the latter is the sub-case when the input is restricted to monadic trees |[39l . Opposed to the 
attempts, e.g., in 1301, we refrain from any arguments based on the combinatorial structure of finite state devices or 
output strings. We also do not follow the line of arguments in |[34l based on semi-linear sets. Instead, we proceed 
in two stages. In the first stage, we consider transducers with unary output alphabets only (Sections |III]-[Vl). In this 
case, a produced output string can be represented by its length, thus turning the transducers effectively into tree- 
to-integer transducers. For a given input tree, the output behavior of the states of such a unary yDT transducer is 
collected into a vector. Interestingly, the output vector for an input tree turns out to be a multi-affine transformation 
of the corresponding output vectors of the input subtrees. As the property we are interested in can be expressed 
as an affine equality to be satisfied by output vectors, we succeed in replacing the sets of reachable output vectors 
of the transducer by their affine closures. This observation allows us to apply exact fixpoint techniques as known 
from abstract interpretation of programs ||4T)1 . to effectively compute these affine closures and thus to decide 
equivalence. In the next step, we generalize these techniques to a larger class of transducers, namely, unary 
non-self-nested transducers. These are transducers which additionally have parameters, but may use these only 
in a restricted way. Although they are more expressive than unary r/DT transducers, the effect of the transducer 
for for each input symbol still is multi-affine and therefore allows a similar (yet more expensive) construction 
as for unary yDT transducers for deciding equivalence. In the final step, we ultimately show that the restriction 
of non-self-nestedness can be lifted. Then however, multi-affinity is no longer available. In order to attack this 
problem, we turn it upside down: instead of maintaining affine spaces generated by sets of input trees, we 
maintain their dual, namely suitable properties satisfied by input trees. Indeed, we show that inductive invariants 
based on conjunctions of polynomial equalities are sufficient for proving equivalence. This result is obtained by 
representing conjunctions of equalities by polynomial ideals iHTl and applying Hilbert’s basis theorem to prove 
that finite conjunctions suffice. We also require compatibility of polynomial ideals with Cartesian products, a 
property which may be of independent interest. For the specific case of monadic input alphabets, we obtain a 
decision procedure which resembles techniques for effectively proving polynomial program invariants by means 
of weakest pre-conditions Il4^ . Il43l . For non-monadic input symbols, we obtain two semi-decision procedures, 
one enumerating all potential proofs of equivalence, while the other searches for counter-examples. 

Having established decidability for all yDMT transducers with unary output alphabet, we indicate in the second 
stage how these transducers are able to simulate yDT transducers with multiple output symbols when these are 
viewed as digits in a suitable number system (Section IVIIb . The corresponding construction maps linear yDT 
transducers into unary non-self-nested yDMT transducers, and general yDT transducers into general unary yDMT 
transducers. In this way, our algorithms for deciding equivalence of unary transducers give rise to algorithms for 
deciding equivalence of linear and general yDT transducers, respectively. While decidability of (in-)equivalence for 
linear yDT transducers has been known (via the result of If34l ). the resulting randomized polynomial complexity 
bounds has only recently been improved to a truly polynomial upper bound by Boiret and Palenta ll44l . No other 
method, on the other hand, allows to decide equivalence of unrestricted yDT transducers. 

Related Work 

Decision procedures for equivalence of deterministic tree transducers have been provided for various sub-classes 
of transducers (see, e.g., ll^ for a recent survey). The equivalence problem for yDT transducers has already 
been mentioned as a difficult open question in |[29l . Still, little progress on the question has been made for 




tree transducers where the outputs are unstructured strings. The strongest result known so far is the decidability 
of equivalence for MSO definahle tree-to-string transductions [341; this class is equal to yDMT transducers of 
linear size increase [35l . For the specific suh-class of linear yDT transducers, we obtain an algorithm with by 
far better complexity bounds as those provided by the construction in [34l . General MSO definable tree-to-string 
transductions on the other hand, can be simulated by t/DT transducers with regular look-ahead (see [451 . [461 ). 
Since equivalence of yDT transducers with regular look-ahead can be reduced to equivalence of yDT transducers 
without look-ahead but relative to a DTTA automaton, our decidability result encompasses the decidability result 
for MSO definable tree-to-string transductions. It is more general, since yDT transducers may have more than 
linear size increase and thus may not be MSO definable. The same holds true for arbitrary yDMT transducers with 
unary output alphabet. It is unclear how or whether the suggested methods can be generalized to the equivalence 
problem of unrestricted yDT transducers. 

The methods employed to obtain our novel results have the following predecessors. The algorithm for deciding 
equivalence in the case of non-self-nested r/DMT transducers is related to the algorithm in [47ll for deciding 
ambiguity equivalence of non-deterministic finite tree automata. Two automata are ambiguity equivalent if they 
agree for each input tree, in the numbers of accepting runs. While vector spaces and multi-linear mappings were 
sufficient in case of finite automata, we required ajfine spaces and multi-affine mappings in case of linear yDT 
transducers. 

The known algorithm for deciding equivalence of yDT transducers with monadic alphabet is based on a 
reduction to the hdtOl sequence equivalence problem. The latter can be solved [3^ via establishing an increasing 
chain of finite sets of word equations which are guaranteed to eventually agree in their sets of solutions. Instead, 
our elegant direct algorithm for monadic unary r/DMT transducers is related to an algorithm effective program 
verification. In [42l . [4311 a decision procedure is presented which allows to check whether a given polynomial 
equality is invariably true at a given program point of a polynomial program, i.e., a program with non-deterministic 
branching and polynomial assignments of numerical variables. Similar to the new algorithm for every state p of 
the DTTA automaton, the verification algorithm characterizes the required conjunction of polynomial equalities 
at each program point by polynomial ideals. These ideals then are characterized as the least solution of a set 
of constraints which closely resembles those in equation (ITOl ). To the best of our knowledge, the algorithm for 
solving the equivalence problem in the unrestricted case, is completely new. 

Organization of the remaining paper 

After providing basic notations and concepts in the next section, we first concentrate on unary transducers. In 
Section |III1 we provide an algorithm for deciding equivalence of yDT transducers with unary output alphabet. 
In Section |IVl we generalize this algorithm to non-self-nested yDMT transducers with unary output alphabet. 
These methods are based on least fixpoint iterations over affine spaces. The case of yDMT transducers with 
unary output alphabet and arbitrary nesting is considered in Section This section makes use of polynomial 
ideals in a non-trivial way. As kind of warm-up, a dedicated algorithm for yDMT transducers with monadic 
input alphabet is provided which is based on least fixpoint iteration over polynomial ideals. The general method 
for arbitrary input alphabets goes beyond that. It is based on the notion of inductive invariants which provide 
proofs of equivalence. The strongest inductive invariant, however, can only be characterized non-effectively via 
a greatest fixpoint over polynomial ideals. Section IVl provides us with a decision procedure of equivalence based 
on two semi-algorithms. In-equivalence can easily be verified by providing a witness inputs for which the outputs 
differ. Searching for a proof of equivalence, on the other hand, may be rather difficult. Therefore, Section |Vl] 
provides a systematic means to identify candidate inductive invariants. Section IVIII indicates how the algorithms 
for unary yDT transducers can be used to decide equivalence for yDT transducers with arbitrary output alphabets 
by providing an appropriate simulation. Section IVIIII then shows that equivalence of yDT transducers remains 
decidable if the output is not considered as a string, i.e., an element of the free monoid, but as an element in 
the free group. This means that we now allow symbols to have positive or negative polarities and assume that 
symbols with opposite polarities may cancel each other out. In order to deal with this situation, we consider a 






setting where the output symbols of a yDT transducer are interpreted as square matrices with rational entries and 
show that in this setting, equivalence is still decidahle. By recalling that the free group with two generators is a 
suh-group of the special linear group of (2 x 2) matrices with entries in Z, we thus arrive at the desired result. 
Finally, Section Hx] discusses applications of the obtained results to various models of transducers as proposed 
in the literature. 

A preliminary version of this paper was presented at FOCS’2015 Il48l . That version has been extended with a 
practical algorithm for enumerating inductive invariants (Section IVl]) and by allowing output not just in the free 
monoid of strings, but instead in the free group (Section IVlllll . 


II. Preliminaries 

For a finite set S, we denote by IS"! its cardinality. For m G N let [m] denote the set {!,... ,m,}. A ranked 
alphabet S is a finite set of symbols each with an associated natural number called rank. By we denote 
that a is of rank m and by the set of symbols in S of rank m. For an m-tuple t and i G [m] we denote 
by tj the fth component of t. The set Ts of trees over S is the smallest set T such that if t G T™ for m > 0 
then also / t G T for all / G For the tree /() we also write /. Thus a tree consists of a symbol of rank 

m together with an m-tuple of trees. We fix the sets of input variables X = {xi, X 2 ,... } and formal parameters 
y = {yi, 2/2, • • • } where for m G N, = {xi, . . ., Xm} and Ym = {yi, • • •, Vm}- 

A {deterministic top-down) tree automaton (DTTA automaton for short) is a tuple A = (P, Il,po,y) where P 
is a finite set of states, S a ranked alphabet, pQ ^ P the initial state, and p the transition function. For every 
/ G and p ^ P, p{p, f) is undefined or is in P™. The transition function allows to define for each p ^ P the 
set dom(p) C 7s by letting / t G dom(p) for / G m > 0, and t G iff p{p, /) = p and tj G dom(pj) 

for i G [m]. The language C{A) of A is given by C{A) = dom(po)- The size |A| of A is defined as |P| + |S| + |/?| 
where \p\ = J2m>o \p~HP'^)\ ' + 1)- 

Let / > 0. A deterministic macro tree-to-string transducer {with I parameters) (yDMT transducer for short) is 
a tuple M = [Q, S, A, 5), where Q is a ranked alphabet of states all of rank / + 1, S is a ranked alphabet of 
input symbols, A is an alphabet of output symbols, go £ Q is the initial state, and 5 is the transition function. 
For every q Q, m > 0, and / G 5{q, f) is either undefined or is in R, where R is fhe smallest set such 

that e G P and if T, Pi,..., T; G P, then also 

(1) aP G P for o G A, 

(2) pjT G P for j G [/], and 

(3) g'(xj,Pi ,... ,Ti)T G P for g' G Q and i G [m]. 

Again, we represent the fact that d{q, f)=T also by the rule: 


q{f{xi,...,Xm),yi,---,yi) -^T. 

A state q ^ Q induces a partial function |g]M from 7s to total functions (A*)^ A* defined recursively as 
follows. Let t = ft with / G m > 0, and t G TfP. Here, we consider a call-by-value (or inside-out) mode 

of evaluation for the arguments of states. Thus for w G (A*)^ |y]M(f)(w) is defined whenever 5(g,/) = P 
for some P and for each occurrence of a subfree g'(xi,Pi ,... ,Ti) in P, |g^]M(ti) is also defined. In this case, 
the output is obtained by evaluating P in call-by-value order with tj taken for Xi and Wj for yj. In function 
applications (especially for higher-order) we often leave out parenthesis; e.g. we write |P]al f w for |P]M(t)(w). 
We obtain, 

l7lM(/t) w = |P]Mtw 


where the evaluation function |P]m is defined as follows: 


|e]Mt w 
|aP']Mt w 
IVjT'jMtw 

|g'(xi,Pi,... ,P/)P']Mtw 


a |P']Mt w 
Wj [P'lAft w 

WIm ti (|Pi] t W, . . . , |P;] t w) |P']m t W. 




The transducer M realizes the (partial) translation M : Te ^ which, for t € 7s, is defined as M{t) = 
■■■,() if [7olM(f) is defined and is undefined otherwise; the domain of this translation is denoted 
dom(M). The yDMT transducer M is a deterministic top-down tree-to-string transducer (yDT transducer for 
short) if ( = 0. The yDMT transducer M is total if 5{q, /) is defined for all g G Q and f and M is unary 
if |A| = 1. In the latter case, the output can also he represented hy a number, namely, the length of the output. 
Finally, a yDMT transducer is self-nested, if there is a right-hand side T in (5 so that T contains an occurrence 
of a tree q'{xi,Ti,... ,Ti) where one of the trees Tj contains another tree q"{xi,T[,... ,T[) for the same Xi. A 
yDMT transducer is called non-self-nested, if it is not self-nested. 

As for DTTA automata, we define the size \M\ of a yDT or yDMT transducer M as the sum of the sizes of the 
involved alphabets, here Q,T, and A, together with the size of the corresponding transition function where the 
size of a transition S{q, f) = T is one plus the sum of numbers of occurrences of output symbols, parameters, 
and states in T. 

In the following three sections, we consider transducers with unary output alphabet A = {d} only. In this case, 
we prefer to let the transducer produce the lengths of the output directly. Then, the right-hand sides T may no 
longer contain symbols d, but constant numbers c (representing d^). Likewise, concatenation is replaced with 
addition. For convenience, we also allow multiplication with constants to compactly represent repeated addition 
of the same subterm. 

Example 1: Consider the yDMT transducer with set Q = {(70) q} of states and initial state go and the following 
transition rules: 

qo{f{xi,X 2 ),yi) q{xi,q{x 2 ,d)) 

q{a{xi),yi) yiq{xi,yi) 

q{e,yi) e 

where the output is considered as a string. This yDMT transducer is non-self-nested and realizes a translation r 
which maps each input tree /(a”(e), a"^(e)) to the string in d” "^. As the output alphabet is unary, we prefer to 
represent the output length by these rules: 

qo{f{xi,X 2 ),yi) q{xi,q{x 2 ,l)) 

q{a{xi),yi) yi + q{xi,yi) 

q{e,yi) 0. 

A DTTA automaton accepting the domain of the given yDMT transducer may use states from {po > p} with initial 
state pq where 

p{PoJ) = {p,p) p{p,a)=P p{p,e) = 0- 

■ 

Thus, right-hand sides T now are constructed according to the grammar: 

T ::= c\yj\q{xi,Ti,...,Ti)\Ti+T 2 \c-T' 

where the non-negative numbers c may be taken from some fixed range {0,1,... , h}. The size |r| then is defined 
as the size of T as an expression, i.e., 

|c| = Idjl = f \T1-\-T2\ = iTil-|-IT2I 

\q{xi,T^,...,Ti)\ = 2+|ri| + ... + |rn |c-r'| = 2 + \T'\. 

Thus, e.g., for T = 2 -|- 3 • q{xi, 1,0), |r| = 4 -f \q{xi, 1,0)| = 4 -|- 4 = 8. 



From Arbitrary to Binary Input Alphabets 

Here we state a technical lemma that allows to restrict the rank of output symbols of our transducers to 
two. Let S he a ranked alphabet and ± a special symbol not in S. By bin(Il) we denote the ranked alphabet 
U I a € S}. For sequences s of trees over S we define their binary encoding bin(s) as: bin(s) = _L 
if s is the empty sequence, and bin(s) = (T(bin(fit 2 • • • fm), bin(s')) if s = a{ti,... with a G 

m > 0, fi,..., fm € 7s, and s' € T-^. Likewise for S C Ts, bin(S') = {bin(s) | s G 5}. Note that this encoding 
corresponds to the first-child-next-sibling encoding of unranked trees, here applied to ranked trees. 

As an example, consider the tree t = f{b,g{c),h{b,c)). Then the encoding bin(t) is given by: 

bin(t) = /(5(_L,5 (c(_L,X),/i(5(_L,c(_L,_L)),X))),±) 

Lemma 1: Let M = (Q, S, A, qo, 6) be a yDMT transducer and let m be the maximal rank of symbols in S. 
Then a yDMT transducer M' = {Q', bin(S), A, ^q, 6') together with a DTTA automaton B can be constructed in 
time polynomial in |M| such that 

(1) bin(dom(M)) = £(i?) n dom(M'), 

(2) M'(bin(t)) = M{t)) for all t G dom(M), 

(3) \Q'\ = m\Q\, 

(4) M' is a total yDT if M is. 

Proof: The DTTA automaton B is meant to check whether a tree in bin(S) is an encoding of a tree in bin(7E), 
i.e., C{B) = bin(7s). Such an automaton can be constructed as B = {P, bin(E), l,p) where P = {0} U [m] and 
p is given by: 

p{j + lj) = {k,j) if/GSW 
p(0,^) = 0 

For this DTTA automaton B, C{B) = bin(7s) holds. 

The yDMT transducer M' is defined as follows. Q' = {(g, i) | q G Q, m > 1, f G [m]} and q'^ = {q^, 1). Let q G 
Q and / G S of arity /c > 0. If 5(q, /) is defined and equals T, then we let 5'{{q^ 1); /) = T' where T' is obtained 
from T by replacing every occurrence of q'{xi, Ti,..., T;) with {q', f)(xi, T{,..., T/) where each T/ is obtained 
from Tj in the same way. Furthermore for every 2 < i < k, we define 6'{{q,i),f) = {q,i — l)(x 2 ,yi, ■ ■ ■ ,yi) 
and finally for all q' G Q', 6'{q',-L) = e. 

By construction, M' is total whenever M is. Also, the bounds to the number of states is obvious. For the 
correctness of the construction, we observe that whenever for i > 1, the state (q, i) is called with the encoding 
of a list ti .. .tk with k > i, then the same output is produced as by M when applied to the input tree L, i.e., 

|(g,i)lM'(bin(ti. ..tk)){yi,.. .,yi) = lqlM{ti){yi, 

The proof is by induction on the structure of the sequence ti ... tk, where for subterms T of right-hand sides of 
M and corresponding subterms T' of right-hand sides of M', 

|r']M'(bin(fi.. = |r]M(fi, ■ ■ ■ ,4) 

holds for all s'. From this, the first two assertions follow. ■ 

Example 2: Consider the following rule: 

q{f{xi,X2,X3),yi) q{xi,q{x2,q{x3,yi))) 

for the ternary symbol / G S. By the construction provided for Lemma [H this rule is simulated by means of the 
rules: 

qi{f{xi,X 2 ),yi) qi{xi,q2ixi,q3{xi,yi))) 

q 2 {g{xi,x 2 ),yi) qi{x 2 ,yi) (s'G S) 

q3{g{xi,X2),yi) q2{x2,yi) (s'G S) 


where the state qi corresponds to the state q and the states 92,93 traverse the encoding of a list tit 2 t 3 where q 
is called for t 2 and t^, respectively. ■ 

We remark that non-self-nestedness may not he preserved hy our construction. 


III. Unary Transducers without Parameters 

We first consider a single unary total yDT transducer and show that equivalence of two states relative to a 
DTTA automaton can he decided in polynomial time. This result then is extended to decide equivalence of two 
not necessarily total unary yDT transducers. Let M = {Q,T,,{d},qo,5) he a total unary yDT transducer, and 
A = {P, T>,po, p) a DTTA automaton. Assume that Q = [n] for some natural number n. Our goal is to decide for 
given q,q' € Q whether or not | 9 |M(f) = for all t G C{A). For every t ^Tt, and q € Q, | 9 ]M(f) = dd' 

with r G N, i.e., | 9 ]at can he seen as a tree-to-integer translation mapping t to r; we denote r hy |f]q and write 
|f] for the vector (|f]i,..., |f|n) G N”, or, more generally, in Q”. For a vector v G Q"" we again denote its ith 
component hy Vj. Then for q e Q, m > 0, f G and ti,... G 7 e, the output |/(ti,... ,tm)jq G Q 

can he computed arithmetically hy 

|/(tl, . . . ,tm)lg = [(5(9,/)lM([tll, . . . , [tm]) (1) 

where for T G (A U Q{Xm))* and a vector x = (xi,... ,Xm) of vectors x* G the number {TJm x is given 
by: 

[c]m X = c 

[j'(a^*)lMX = Xij 

K + T'Imx = ITi'Im X + IT'Im X 

[c • T'Jm X = c • [T'Jm X. 

By structural induction on T, we conclude that 

m n 

ITJm x = + ■ Xij 

i=l j=l 

for suitable numbers 60 , bij G Q. Thus, {TJm and hence also |( 5 ( 9 , /)]m constitutes a multi-affine mapping from 
(Q”)’” to Q. Technically, a multi-affine mapping H is affine in each argument. This means that the transformation 
H' corresponding to the fcth argument and fixed xi,... ,Xk-i,Xk+i, ■ ■ ■ ,Xm, which is defined by: 

H'{x') = iT(xi,...,Xfc_i,x,Xfc+i,...,x„) 

is affine, i.e., 

n n 

H'iyo + K{yr - yo)) = H'iyo) + Xr{H'{yr) - iT'(yo)) 

r=l r=l 

holds for vectors yo,..., yn G and Ai,..., G Q. Accordingly, we define the (output) semantics of / G S 
of arity m as the function |/] : (Q”)’” —> by: 

[/I X = ([5(1, /)]m X,..., |5(n, /)|m x) (2) 

which again is multi-affine. 

Theorem 2: Let S be a fixed ranked alphabet, and A a DTTA automaton over S. Let M a total unary yDT 
transducer with input alphabet S, and q, q' states of M. It is decidable in polynomial time whether or not 
lolMit) = Wjuit) for all t G £(A). 

Proof: By repeated application of the transformations |/],/ G S, every tree t ^ Te gives rise to a vector 
|f] G For a set S' C 7 e, let [S'] = {[f| | f G S}. Then two states q,q' are equivalent relative to 5 C 7s 
iff Hqqi(jv) = 0 for all v = (vi,..., v„) G [S'], where the function Hqqi is given by Hqqf'v) = Vg — Vg/. The 



forall [p G P) Bp := 0; 

repeat 

done := true; 

forall {p,pi,...,pmeP,f eY. with p{p, f) = {p^,... ,p^)) 
forall ((vi,..., Vm) e Bp^ X ...X Bp^) 
v:=[/l(vi,...,Vm); 
if V 0 aff(i?p) 

Bp\=Bp[j{w}; 

done := false; 
until (done = true); 

Figure 1. Computing bases for the closures aff({|f] | t G dom(p)}),p G P. 


set of vectors in |£(^)] can be characterized by means of a constraint system. Consider the collection of sets 
Vp,p ^ P, which are the least sets with 

Vp^m{Vp,,...,VpJ (3) 

whenever p{p, /) = (pi,... ,Pm) holds. Then {[t] | t G Y{A)} is precisely given by the set Vp^. 

For a set C C (Q” of n-dimensional vectors, let aff(l/) denote the affine closure of V. This set is obtained 
from V by adding all affine combinations of elements in V: 

r 

afF(l/) = {so + ^ Aj • {sj - So) | r > 0, so,... , G C, Ai, ..., G Q}. 
i=i 

Every set V C has a subset B C V of cardinality at most n + 1 such that the affine closures of B and V 

coincide. A set B with this property of minimal cardinality is also called affine basis of aff(l/). For an affine 
function H : Q such as Hqqi and every subset V C Q^, the following three statements are equivalent: 

1) B(v) = 0 for all V G E; 

2) H(v) = 0 for all v G afF(E); 

3) ff(v) = 0 for all V G i? if S is any subset of V with afF(i?) = afF(l/). 

Instead of verifying that B(v) = 0 holds for all elements v of Vp^, it suffices fo fesf H(v) = 0 for all elemenfs 
V of an affine basis B C Vp^. Accordingly, we are done if we succeed in computing an affine basis of the set 
aff(l^o)- If is unclear, though, how the least solution Vp,p G P, of the constraint system ([3]) can be computed. 
Instead of computing this least solution, we propose to consider the least solution of the constraint system (O, 
not over arbitrary subsets but over affine subsets of only. Like the set of all subsets of Q (ordered by 

subset inclusion), the set Al(Q”) of all affine subsets of Q (still ordered by subset inclusion) forms a complete 
lattice, but where the least upper bound operation is not given by set union. Instead, for a family B of affine 
sefs, fhe least affine sef containing all B G 0 is given by: 

Ue = afF(US). 

We remark that affine mappings commufe with least upper bounds, i.e., for every affine mapping F : > Q", 

F(IJ^) = F(aff(Ui3)) = afF(F(US)) 

= aff({F(v)) I V G U®}) 

= I B e B}. 

Lef Vp,p G P, and Vp,p G P, denofe fhe least solutions of ([3l) w.r.t. the complete lattices P(Q”) and Al(Q”), 
respectively. Since for each / G S, |/] is affine in each of its arguments, it follows by the transfer lemma of 
ll^ (see also Il50l ). that 


^^f{Vp) = V^ (pGP). 


The complete lattice on the other hand, satisfies the ascending chain condition. This means that every 

increasing sequence of affine subsets is ultimately stable. Therefore, the least solution Vp ,p ^ P, of the constraint 
system ([3]) over ^(Q”) can be effectively computed by fixpoint iteration. One such fixpoint iteration algorithm 
is presented in Figure [T] Each occurring affine subsef of Q” is represenfed by a basis. For fhe resulting basis 
Bp^ of the affine subset Vp^^ = afi{Vpg) we finally may check whether or not Hqq'{w) = 0 for all v G Bp^, which 
completes the procedure. 

The algorithm of Figure [T] starts with empty sets Bp for all p ^ P. Then it repeatedly performs one round 
through all transitions p{p,f ) = (pi, • ■ ■ ,Pm) of A while the flag done is false. For each transition p{p,f) = 
(pi,.. .pm), the transformation |/] is applied to every m-tuple v = (vi,..., v^) with Vj G Bp.. The resulting 
vectors then are added to Bp — whenever they are not yet contained in the affine closure afi{Bp) of Bp. The 
iferafion ferminafes when during a full round of the repeat-until loop, no further element has been added to any 
of the Bp. 

In the following, we assume a uniform cost measure where arithmetic operations are counted as 1. Thus, 
evaluating a right-hand side 5{q,f) takes time at most proportional to the number of symbols occurring in 
6{q,f). Concerning the complexity of the algorithm, we note: 

• The algorithm performs at most /i • (n + 1) rounds on the repeat-until loop {h and n are the number of states 
of A and M, respectively); 

• In each round of the repeat-until loop, for each transition of A, at most (n + I)'” tuples are considered (m 
is the maximal arity of an input symbol); 

• for each encountered vector, time O(n^) is sufficient to check whether the vector is contained in the affine 
closure of the current set Bp (see, e.g., chapter 28.1 of lISTl ). 

Accordingly, a full round of the repeat-until loop can be executed in time 0(|A| • \M\ •n’") — giving us an upper 
complexity bound 0(|A| • \M\ ■ hnP'~^^) for the algorithm where m can be chosen as 2, according to Femma[T] 

■ 

The base algorithm as presented in the proof of Theorem |2l can be further improved as follows: 

• We replace the Round-robin iteration by a worklist iteration which re-schedules the evaluation of a transition 
of A for a state p' € P and an input symbol / only if Bp. for one of the states pi in p{p', f) has been updated. 

• We keep track of the set of tuples which have already been processed for a given pair {p', /), / an input 
symbol and p' state of A. This implies that throughout the whole fixpoint iteration, for each such pair {p', /), 
inclusion in the affine closure must only be checked for (re -|- 1)™ tuples. 

• For a non-empty affine basis B, we can maintain a single element v' £ B, together with a basis of the linear 
space Lb corresponding to B, spanned by the vectors {v — v'),v G B\{v'}. By maintaining a basis of in 
Echelon form, membership in afl{B) can be tested in time 0{v?). 

Applying these three optimizations, the overall complexity comes down to 0(|A| • \M\ ■ 

So far, we have compared the output behavior of two states of a unary total yDT transducer M relative to 
some DTTA automaton only. Our decision procedure for equivalence, however, readily extends to arbitrary unary 
yDT transducers. Note that the exponential upper bound of Theorem [3] is sharp, since non-emptiness for unary 
yDT transducers is already FxPTlME-complete (see Theorem 9 of |[39ll ). 

Theorem 3: Equivalence for (possibly partial) unary yDT transducers can be decided in deterministic expo¬ 
nential time. If the transducers are linear, then the algorithm runs in polynomial time. 

Proof: First, we, w.l.o.g., may assume that we are given two states q^, q'^ of a single yDT transducer, and the 
task is to decide whether the partial mappings [qolM and I^oIm coincide, i.e., whether (1) [qolM(f) is defined 
iff WolM{t) is defined, and (2) |qolM(f) = WolMit) whenever bofh are defined. In order fo decide the former 
task, we construct DTTA automata A, A' where the languages of A and A' are precisely given by the domains 
of the translations I^oIm and [qolM> respectively. 


The set of states and transitions of A can be determined as the smallest subset F of sets Q' C [n] together 
with the partial function p as follows. First, {qo} € P which also serves as the initial state of A. Then for every 
element Q' £ P and every input symbol / € E of some arity m, where 6{q, f) is defined for each q £ Q', every 
set Q' is contained in P for i = 1,..., m, where Q'- is the set of all states q' £ [n] such that q'{xi) occurs in the 
right-hand side 6{q, f) for some q £ Q'. In this case then p has the transition p{Q', f) = Qi--- Q'm- The DTTA 
automaton A' is obtained by starting with the initial state {gg} instead of {g'o}^ and subsequently proceeding 
analogously to the construction of A. Assume that the number of states of A and A' are q and q', respectively. 
Then property (1) is satisfied iff C{A) = C{A'). This can be verified in time polynomial in the sizes of A and 
A'. Now assume that C{A) = C{A'). Then we construct a total yDT transducer M' from M by adding to the 
transition function of M a transition q{f{xi,... ,Xm)) —^ e for every state q and input symbol / — where M 
does not yet provide a transition. By construction, = [9lM(f) whenever is defined. Therefore 

for every t £ C{A), {qojM'it) = {q'olM'it) iff {qalnit) = WolMit). Using the algorithm of Theorem d the 
latter can be decided in time polynomial in the sizes of A and M'. 

The size of the DTTA automaton A characterizing the domain of the yDT transducer M is at most exponential 
in the size of M. In case, however, that M is linear, the size of the corresponding automaton A is at most linear 
in the size of M. From that, the complexity bounds of the theorem follow. ■ 

Theorems d and dean be applied to decide Abelian equivalence of r/DT transducers with arbitrary output alphabet. 
Abelian equivalence of two deterministic tree-to-string transducers means that the outputs for every input tree 
coincide up to the ordering of output symbols. 

IV. Non-Self-Nested Unary Transducers with Parameters 

We consider unary non-self-nested yDMT transducers and show that their equivalence problem can be solved 
in co-randomized polynomial time. This implies equivalence with the same complexity for (arbitrary) linear yDT 
transducers. 

Deterministic macro tree-to-string transducers (yDMT transducers) combine yDT transducers with the nesting 
present in macro grammars. Each state of a yDMT transducer takes a fixed number of parameters (of type output 
tree). Recall that a yDMT transducer is non-self-nested if whenever q'{xj ,...) occurs nested in ...) implies 
that i j. Note that non-self-nested yDMT transducers are strictly more powerful than yDT transducers as shown 
in the following lemma. 

Lemma 4: The translation of Example [H which is realized by a non-self-nested unary yDMT transducer, cannot 
be realized by any yDT transducer. 

Proof: Eor convenience, we prove a slightly stronger result, namely, that this translation also cannot be 
realized by any yDT transducer even if it is equipped with regular look-ahead (a yDT*^ transducer). Assume for a 
contradiction, that a given yDT*^ transducer N realizes the translation of M where N has a finite set Q of states 
and uses the finite bottom-up automaton B for providing look-ahead information about the input. Eet ni 7 ^ n 2 
so that a’^i(e) and a’^=(e) correspond to the same look-ahead state of B. Then for z = 1,2 and j = 1,2, 

N{f{a^fe),a^^{e))) = c-f XlqeQ (e)) -h EqeQ (e)) 

for suitable numbers c, Cg, Cg (independent of i,j). Eor j = 1, 2, consider the difference in the outputs: 

A,- = Ar(/(a-He),a-^(e)))-A(/(a-=(e),a-^(e))) 

= T.q&QCq{MN{a^^{e)) - Miv(a’""(e))) 

and observe that it is independent of j. According to our assumption, N realizes the translation of M. Therefore, 

0 = Ai — A 2 

= (ni - 77-2) • ni - (ni - 77-2) • 772 
= {ni - 772) • (771 - 772) 

/ 0 


— a contradiction. Hence the translation of M cannot be realized by any yDT^ transducer. ■ 

As in the case for unary ?/DT transducers, we first consider total unary yDMT transducers only, but relative to 
a DTTA automaton A. Assume that a unary yDMT transducer M is given by M = ([n], S, {d}, io,6). Recall that 
we assume that all states have exactly I + 1 parameters where the first one is the input tree and the remaining I 
parameters accumulate output strings, i.e., numbers. The output for a state q and an m-ary input symbol / G S, 
then is given by: 


[/(ti,...,tm)|qy = |r]M([ti] y, y) (4) 

when q{f{xi, ..., Xra),yi, ■ ■ ■ iVi) ^ T is a rule of M, and y is a vector of parameters in Here, |T'] x y 
for a vector x = (xi,..., x^) of vectors Xj G > (Q)*^ is defined by: 


Mm X y 
IVklM X y 
|c • T'Im X y 
IT' + T'Im X y 
lj{xi,T[,... ,T[)j xy 


c • |r'] X y 

MUm X y + [T^Im X y 
Xii(lT{]M X y,..., |r/]M X y). 


By structural induction, we verify that for all input trees f G 7s and all states q of the r/DMT transducer M, |f]q 
is an affine function —)• Q, i.e., [fjg y = Vgo + v^iyi + ... + v^/y^ for suitable Vgj G Q. Accordingly, |f] can 
be represented as the two-dimensional matrix v = (vgj) G 

Now assume that the arguments x*, i = 1,..., m, are all vectors of affine functions Q, and let x denote 

the triply indexed set (x^jk) of coefficients in Q (r = 1,..., m, j = 1,..., n and k = 0,... ,1) representing these 
functions. Then 


I/lgxy = + + (5) 

where is a polynomial over the variables x. Thus, |/] can be represented by the matrix = (rj^) G 

Q[:XLjnx{l+l) _ 

Example 3: Consider the yDMT transducer M from Example [T] which we extend to a total r/DMT transducer 
by adding the rules 

qo{a{e),yi) 0 qo{e, yi) 0 q{f{xi,X 2 ),yi) 0 


Then we obtain: 


I/lgo(xi,X2)(yi) 

Hg(xi)(yi) 

Iel.()(yi) 


XlgO + Xlql • (X2g0 + X2ql ' 1) 
XlgO + Xlql • X2g0 + Xlgl • X2gl 

yi + Xiqo + Xi,i • yi 

XlqO + (1 + Xlql) • yi 

0 . 


In this section, we first examine the case that the r/DMT transducer M is non-self-nested (such as the yDMT 
transducer from Example [T]). Then each polynomial in ([5]) is a sum of products: 

^ * • • • * ^isjsks 

where the ii,... ,is are pairwise distinct, i.e., each argument Xj contributes at most one factor to each product. 
We conclude that the transformation |/] is multi-affine. This means that the mapping |/] when applied to an 
mtuple of values in (i.e., vectors of affine functions) is an affine function of each of the x* and y, when 

the other arguments are kept constant. Thus, |/] commutes with affine combinations in any of the arguments Xj 
and, for each sequence xi,... ,Xm of matrices in again results in an affine function of y. 


As in the case of yDT transducers, we can construct a constraint system analogously to the system of 
constraints Q whose unknowns are indexed with the states from the automaton A — only that now each 
unknown Vp receives a set of values in (vectors of affine transformations) instead of values in (plain 

vectors). This constraint system has a least solution where the value for Vp is the set of all affine transformations 
[f|,f E dom(p). 

The two states equivalent with empty parameters relative to A iff Tf(|t]) = 0 for all t E C{A) 

where H{v) = Vg^o — Vg^o for v = (vg^) E (recall that for the affine function Vg = (vgo,..., Vg;), 

Vg(0,... , 0) = Vgo). As in the last section, the function H for testing equivalence of states, is affine. 

For a set y C of matrices, let aff(y) denote the affine closure of V. This closure is defined 

analogously as for vectors. Only note that now an affine basis of the affine closure of V may have up to re-((+l) + l 
elements (compared to n + 1 in the last section). Now let H denote any affine funcfion H : ^ (Q. 

Analogously fo fhe lasf section, for every set V C H{v) = 0 holds for all u E F iff H{v) =0 holds 

for all u in a basis of aff(y). We conclude that it suffices fo defermine for each sfafe p' of A, an affine basis 
Bpi of the set Vpi and then verify that Tf(v) = 0 for all v E Bp^ if po is the initial state of A. With a similar 
algorithm as in the last section this is possible using a polynomial number of arithmetic operations only — given 
that the maximal arity of input symbols is bounded. Therefore, we obtain: 

Theorem 5: Assume that M is a non-self-nested total unary yDMT transducer and A is a DTTA automaton. 
Then for every pair q, q' of states of M, it is decidable whether q and q' are equivalent relative to A. If the 
arity of input symbols is bounded by a constant, the algorithm requires only a polynomial number of arithmetic 
operations. 

In case of non-self-nested r/DMT transducers and multi-affine functions, the lengths of occurring numbers, however, 
can no longer be ignored. In order to calculate an upper bound to the occurring numbers, we first note that for 
each state p' of A, the basis of aff(l^/) as calculated by our algorithm, is of the form |f] for a tree in C{A) of 
depth at most {{I + 1) ■ n + 1) ■ h A n,l and h are the numbers of states and parameters of M, and the number 
of states of A, respectively. Concerning the lengths of occurring numbers, we prove: 

Lemma 6: Assume that M is a non-self-nested unary yDMT transducer M where the ranks of input symbols 
are bounded by m, and the constants occurring in right-hand sides of rules are bounded by h. Then 

if N is the depth of t and b is the maximum of the argument numbers yi,..., y;. 

Proof: The proof is by induction on the depth of t. Thus, assume that t = f t with t = (ti,..., t^), m >0, 
and assume that the induction hypothesis holds for the tj. Let q(/(xi,..., Xm),yi, ■ ■ ■ iVi) ^ T he a. rule of M. 
Then for y = (y^,... ,y;), 

MmWy = {TjMty 

For T let a{T) denote the nesting depth of calls q{xi,...). Note that a{T) < m since M is assumed to be 
non-self-nested. Since \T\ < \M\, and the depth of each tj is less than the depth of t, the assertion follows from 
the following claim: 

(iTlMty) < (/i+l)mW)+l)-(|M|.(r„+l))«- 

if N is the maximal depth of a tree tj. The proof of this claim is again by induction, but now on the structure 
of right-hand side T. 

If T is a constant or equals yj for some j, the claim obviously holds. In case T equals a sum Ti -|- T 2 or a 
scalar product c ■ T', the claim also follows easily from the inductive hypothesis. It remains to consider the case 
where T = q'{xi,Ti,... ,Ti). By inductive hypothesis for the Tj, we find that for every i, 

miM ty<{h + l)IT|■a(^).(|M|.(„^+l))-- . ^ 


since the nesting depth of each Ti is at most a{T) — 1. Therefore, 

= W]m (t*) ([TiJm t y,..., [TjJm t y) 

< (/i + l)(|MKm+l))^-i . (/j ;^)|r|.a(M).(|MKm+l))"-i . ^ 

< (/i + l)(|MKm+l))~-i + |r|.a(M).(|MKm+l))«-i . ^ 

< (/i + l)|T|.(a(M) + l).(|M|.(m+l))'^-i . 5 

since |Tj| < |T| and the nesting-depths of any of the \Ti\ is hounded hy a{T) — 1. This completes the proof. ■ 

Accordingly, the bit length Z{N) of numbers occurring in |f] for trees of depth N is hounded hy 0((|M| • 
(m + 1))^) where m is the maximal rank of an input symbol. This means that, for m > 1, the bit lengths of 
occurring numbers can only be bounded by an exponential in the sizes of M and A. Still, in-equivalence can be 
decided in randomized polynomial time: 

Theorem 7: In-equivalence of states of a non-self-nested total unary yDMT transducer relative to a DTTA 
automaton, is decidable in randomized polynomial time, i.e., there is a polynomial probabilistic algorithm which 
in case of equivalence, always returns false, while in case of non-equivalence returns true with probability at 
least 0.5. 

Proof: Assume that M is a non-self-nested total unary r/DMT transducer and A a DTTA automaton. Our goal 
is to decide whether or not [qolM(f) = l9olAf(0 foi" ^ ^ Now let k denote any prime number. Then the 

set of integers modulo k, Z^, again forms a field. This means that we can realize the algorithm for determining 
affine closures of fhe sefs Vp as well as the check whether an affine mapping H refums 0 for all elemenfs of an 
affine basis now over Z^. The resulting algorithm allows us to decide whether the outputs for coincide for 
all inputs from C{A) modulo the prime number k by using polynomially many operations on numbers of length 
0(log(/c)) only. In particular, if non-equivalence is found, then qq, Qq cannot be equivalent relative to A over Q 
either. 

Let 2^ be an upper bound to Z{n ■ {I + 1) + 1) ■ h) {n,l the number of states of M and the number of 
parameters of states of M, respectively, and h the number of states of A) where D is polynomial in the sizes of 
M and A. Then we have: 

Lemma 8: qq, Qq are equivalent relative to A iff qo, are equivalent relative to A modulo 2^ distinct primes. 

Proof: Assume that the latter holds. Then the product already of the smallest 2^ primes vastly exceeds 2^ . 
Therefore by the Chinese remainder theorem, Tf([f]) = 0 holds also over Q for all t G C.{A) of depth at most 
(n • (f -|- 1) -|- 1) • h. Therefore, qq, q^ must be equivalent. ■ 

Clearly, if qo and q^ are equivalent relative to C{A), then they are also equivalent relative to C{A) modulo every 
prime number k. Therefore now assume that qo and qQ are not equivalent relative to C{A). Let K denote the set 
of all primes k so that qo and q'o are still equivalent relative to C{A) modulo k. By Lemma HI this set has less 
than 2^ elements. Now consider the interval [0,1? • e^]. Note that each number in this range has polynomial 
length only. When D is suitably large, this interval contains at least >4-2^ prime numbers (see, e.g., 1521 ). 
Therefore, with probability at least 0.75, a prime randomly drawn from this range is not contained in K and 
therefore witnesses that qo and q'^ are not equivalent relative to C{A). Since a random prime can be drawn in 
polynomial time with probability 0.75, and 0.75 • 0.75 > 0.5, the assertion of the theorem follows. ■ 

V. General Unary Transducers with Parameters 

In the following, we drop the restriction that the yDMT transducer M is necessarily non-self-nested. Then the 

{ f) 

polynomials are no longer necessarily multi-linear in the variables xi,...,Xm- Accordingly, techniques 
based on affine closures of the sets |dom(p)] are no longer appropriate. 

Instead, we propose to generally reason about properties satisfied by the elements of the sets |dom(p)]. Let 
z = {zqfc I q = 1,... ,n,k = 0,...,Z} denote a fresh set of variables. The key concept which we introduce 
here is the notion of an inductive invariant of M relative to the DTTA automaton A. As candidate invariants we 
only require conjunctions of equalities r = 0 where r G Q[z] is a polynomial over the variables z with rational 


coefficients. Instead of referring to such a conjunction directly, it is mathematically more convenient to consider 
the ideal generated from the polynomials in the conjunction. Formally, an ideal of a ring R is a subset J C R 
such that for all a, a' ^ J, a + a' € J and for all a G J and r € R, r ■ a G J. The smallest ideal containing a 
set S of elements, is the set (S) = I ^ ^ 0, ri,..., G R,si, ... ,Sk G 5}. The smallest ideal 

containing ideals Ji, J 2 is their sum R + J 2 = {si + S 2 | si G Ji, S 2 F •^21- 

Using ideals instead of conjunctions of polynomial equalities is justified because for every S C Q[z] and every 
V G it holds that s(v) = 0 for all s G {S) iff s(v) = 0 for all s G 5. 

An inductive invariant I of the yDMT transducer M relative to A is a family of ideals Ip C (Q[z],p G P, such 
that for all transitions p{p, f) = (pi,... ,Pm), 

Ip C {/G Q[z] IG (4i(^i) U • • • U Ip^(xm))} (6) 

holds where we used [v/z] to denote the substitution of the expressions for the variables zjk- Likewise for 
an ideal J C Q[z], J(xj) denotes the ideal: 


J(xi) = {s[xi/z] I s G J}. 


The constraint ® for the transition p{p, /) = (pi,... ,Pm) formalizes the following intuition. For every polyno¬ 
mial r', the polynomial r'[r^I'>/z] can be understood as the weakest precondition of r' w.r.t. the semantics of 
the input symbol /. It is a polynomial in the variables x where the variables in x* refer to the tth argument of /. 
The constraint ® therefore expresses that the weakest precondition of every polynomial in Ip can be generated 
from the polynomials provided by I for the states pi — after the variables z therein have been appropriately 
renamed with Xj. 

We verify for every inductive invariant I that each polynomial in the ideal Ip constitutes a valid property of 
all input trees in dom(p). This means: 

Theorem 9: Assume that I is an inductive invariant of the pDMT transducer M relative to A. Then for every 
state p of A and polynomial r' G Ip, r'(|f]) =0 holds for all t G dom(p). 

Proof: By structural induction on t, we prove that r'df]) = 0 holds. Assume that p{p,f) = (pi, • • • ,Pm) 
and t = f{ti,... Am) where (by induction hypothesis) for every i = 1,..., m and every r' G Ip., r'(|ti]) = 0 
holds. Since ftjgk = , [tm]), we have that 

r'(M)=r'[r(/)/z]([til,...,[Ul) 

Since I is inductive, r'[r^I'i/z] can be rewritten as a sum: 

m Ui 

r'[r^I^/z] r'ip^Sipfxi/z] 

i=l /ii=l 


for suitable polynomials where for i = 1,..., m, all Sip. G Ip^. Therefore for all pi, = 0, and thus 


/(M)=r'[r(/)/z]([til,...,[U) = 0. 


We conclude that every inductive invariant I with r' G Ip provides us with a certificate that r'(|f]) = 0 holds 
for all t G dom(p). In the next step, we convince ourselves that the reverse implication also holds, i.e., for all 
polynomials r' for which r'(|t]) = 0 holds for all i G dom(p), an inductive invariant I exists with r' G Ip. In 
order to prove this statement, we consider the family I of ideals Ip,p ^ P, where 

ip = {/ G Q[z] I Vf G dom(p). r'(|f]) = 0}. 

Thus, Ip is the set of all polynomials which represent a polynomial property of trees in dom(p). We next prove 
that I is indeed an inductive invariant. 

Theorem 10: I is an inductive invariant of the yDMT transducer M relative to A. 


Proof: For any set V C of vectors, let Z(V) denote the set of polynomials r' over z which 

vanish on V, i.e., with r'(v) = 0 for all v € V. We remark that for disjoint sets of variables xi,... ,Xm with 
Xj = {xjjfc I j = 1,..., n, A: = 0,..., and arbitrary sets Vi C 

T{ViX...X Vm) = U . . .UT{Vm){^ra)) (7) 

holds when considered as ideals of Q[x] = Q[xi U ... U x^]- This means that the set of polynomials which 
vanish on the Cartesian product Ci x ... x Vm is exactly given by the ideal in Q[x] which is generated by the 
polynomials in Q[xj] which vanish on the set Cj (z = 1 ,..., m). 

We remark that the ideal of Q[x] generated from T{Vi){xi) is exactly given by x x where 

T = Accordingly, equality ([7]) can be rewritten to: 

m 

I{ViX...x Vm) = X V- X T™-'). 

i=l 

Thus, equality ([7]) is a consequence of the following lemma, which we could not find in the literature. Although 
formulated for Q, the lemma holds (with the same proof) for any field. 

Lemma 11: Let Vi C T 2 C be subsets of vectors, with mi, m 2 positive integers. Then 

X{Vi X C 2 ) = AVi X +X(Q™^ X C 2 ). 

Proof: Since Vi x V 2 V Vi x it follows that Ii := X(l/i x Q™'^) C X{Vi x V 2 ). Likewise, I 2 := 

X V 2 ) C XiVi X V 2 )> and the inclusion “D” follows. 

The proof of the reverse inclusion uses Grdbner bases (for basic notions and concepts on Grdbner bases see 
the textbook by Becker and Weisspfenning iHTl l. Let x = {xi ... jX^i+ma} a suitable finite set of variables. 
Fix a monomial ordering on the polynomial ring Q[x]. With respect to this monomial ordering, let Gi, G 2 be 
Grdbner bases of Ii and I 2 , respectively. Clearly Gi U G 2 generates the sum Ii + I 2 . Since Ii is generated 
by polynomials in Q[xi,... ,XmJ, we have Gi C Q[xi,... ,XmJ, and also G 2 C Q[xmi+i, • •. jX^i+ms]- It 
follows by Buchberger’s criterion (see lldTl Section 5.5]) that Gi UG 2 is a Grdbner basis of Ii + / 2 - This implies 
that each polynomial / € Q[x] has a unique normal form g := nf(/), which (by definition) has no monomial 
that is divisible by the leading monomial of any polynomial in Gi U G 2 , and which satisfies f — g ^ Ii + l 2 - 
Moreover, if / € /i + /2 then <7 = 0. 

For the proof of the reverse inclusion, take / G Q[x] that does not lie in Ii + I 2 . So g := nf(/) 7 ^ 0. We 
have to show / ^ 'L{Vi x V 2 ), so we have to find v € Vi and w € V 2 such that /(v,w) 7 ^ 0. Considered as a 
polynomial in the variables xi,...,x^i, g has a nonzero term ■ ■ ■ x^^^ with c G Q[xmi+i, ■ ■ ■, Xmi+ma]- 
Since none of the monomials of c are divisible by any leading monomial of a polynomial from G 2 , the Grdbner 
basis property of G 2 implies c ^ I 2 , so there exists w G C 2 such that c(w) 7 ^ 0. 

Now consider the polynomial pw := 5 (xi,..., x^i, w) G Q[xi,...,x^J- This is nonzero since one of 
its coefficients, c(w), is nonzero. Moreover, no monomial from pw is divisible by any leading monomial of a 
polynomial from Gi, so g^ ^ h, implying that there is a vector v G Ci with Pw(v) 0. This means g{v, w) 7 ^ 0. 
But (/ — gr)(v, w) = 0 since / — <7 G /i + /2 C X{Vi x V 2 ), so we obtain f{-v,-w) 7 ^ 0, finishing the proof. ■ 
For each state p of A, let Vp = {|f] | t G dom(p)}. Then the ideal Ip is exactly given by Ip = T{Vp). Assume 
that r' G Ip and p{p, f) = {pi,... ,Pm) holds. Then for all tuples of trees with tj G dom(pj) 

(i = 1,..., m), r'(|/(ti,..., tm)]) = 0 holds. Therefore, 

r'[ri7)/z](vi,..., v^) = 0 

holds for all (vi,..., v^) G Vp^ x ... x Vp^. Therefore, 

r'[r(7)/z] G X{Vp, x ... x VpJ 

= (X(CpJ(xi) U ._. .UX{VpJ{xm)) by © 

= (/(xi) U . . . U/(Xm)). 


As a consequence, I satisfies the constraints ® and therefore is an inductive invariant of M relative to A. ■ 

The inductive invariant I is the largest invariant and, accordingly, the greatest fixpoint of the inclusions 
Since the set of polynomial ideals in Q[x] has unbounded decreasing chains, it is unclear whether I can he 
effectively computed. 

Let us first consider the case where the input alphabet of M (and thus also of A) is monadic. Then the variables 
from z can be reused for the copy xi of variables for the first (and only) argument of / E implying that 
every polynomial can be considered as a constant or a polynomial again over the variables z. Thus, the 
constraints in ® to be satisfied by an inductive invariant I, can be simplified to: 

Ip C {/E Q[z] I r'(r(^)) = 0} ifp(p,5) = () (8) 

Ip C {/ E Q[z] I r'[r^I^/z] E Ip^} if p{p, f) = pi (9) 

According to the second constraint, the demand for an equality r' = 0 to hold at p is transformed by the monadic 

input symbol / into the demand for the equality /z] A Q to hold at pi. The propagation of these demands 

generated for the equality H = 0 to hold at po can be expressed by the following system of constraints: 

Ipo 2 {H) 

Ipi 3 {r'[r^I^/z]\r' £ Ip} if p{pj) = Pi (10) 

Recall that Hilbert’s basis theorem implies that each ideal J C Q[z] can be represented by a finite set of 
polynomials si,... ,Su so that J = (si,..., Su) and likewise, that each increasing chain Jq C C ... of ideals 
is ultimately stable. Therefore, the system (fTOl) has a least solution, which is attained after finitely many fixpoint 
iterations. We claim: 

Lemma 12: Assume that I is the least solution of the system of constraints (ITOb . Then I is an inductive 
invariant iff for each transition p{p, b) = () of A, = 0 for all r' E Ip. In this case, it is the least inductive 

invariant I' with H £ I'p^- Otherwise, no inductive invariant with this property exists. 

Proof: We have that / is a solution of (ITOl) iff I satisfies fhe consfrainfs (|9l). Moreover, = 0 for all 

r' £ Ip' holds for all p{p, b) = () of A iff I safisfies the constraints dH). Therefore, / is an inductive invariant 
with H £ IpQ, iff these two assumptions are met. 

Now assume that I is the least solution of (fTOl) . If it passes all tests on the transitions p{p, b), it therefore must 
be the least inductive invariant I' with H £ Ip^- If it does not pass all tests, then there cannot be any inductive 
invariant I' with H £!},„■ This can be seen as follows. Assume for a contradiction that there is an inductive 
invariant I' with H £ Ip^- Since I' satisfies fhe consfrainfs in @, I' is also a solution of (ITOl) . Therefore, Ip C I'^ 
for all states p of A. Now since already I does not pass all tests on transitions p{p,b) = (), then I' cannot 
pass all these tests either. But then I' does not satisfy the constraints ® and therefore fails to be an inductive 
invariant — contradiction. ■ 

Since the least solution of system (fTOl) can effectively be computed and the tests required by Lemma can also 
be effectively performed, we obtain: 

Theorem 13: For a total unary yDMT transducer M with a monadic input alphabet, it is decidable whether or 
not two states are equivalent relative to a DTTA automaton A. 

Proof: Let H denote the polynomial Zj^o — Zj^o- Then = 0 for all t' £ dom(po) holds iff H £ Ip^. 

Now, H £ Ipg for some inductive invariant I iff H £ Ip^ for the least inductive invariant I' which, by Lemma [12] 
can be effectively computed. Since membership of a polynomial in an ideal can be effectively decided, the claim 
of the theorem follows. ■ 

In the following, we finally drop also the assumption that the pDMT transducer has a monadic input alphabet. 
What we keep is the assumption that the output alphabet is unary. For this case, we prove that equivalence is still 
decidable. An indicator for the extra complication due to non-monadic input symbols is that we are only able to 
provide two i'emi-algorithms, one which provides a proof of equivalence if equivalence holds, and another which 
provides an input tree for which the output differ — whenever non-equivalence holds. 


In case of non-monadic input symbols, it is no longer clear whether computing the greatest solution of constraint 
system ® can be replaced by computing the least solution over some suitably defined alternative constraint system 
over ideals. What we still know is that every ideal of Q[z] can be represented by a finite set of polynomials with 
coefficients, which can be chosen from Z. Since the validity of the inclusions ® can be effectively decided for 
any given candidate invariant I, the set of all inductive invariants of M relative to A is recursively enumerable. 
Accordingly, if zj^q — Zj'o = 0 holds for all vectors |f],f G dom(po)> an inductive invariant certifying this fact, 
will eventually be found in the enumeration. In this way, we obtain a semi-decision procedure for equivalence 
of the states jo,j'o of M relative to A. The fact, on the other hand, that zj^o ~ Zj'o = 0 does not hold for all 
[f],f € dom(po), is witnessed by a specific tree t G dom(po) for which — Mao 7 ^ 0- Since dom(po) is 
recursively enumerable as well, we obtain another semi-decision procedure, now for non-equivalence of states 
joij'o of M relative to A. Putting these two 5 em/-decision procedures together, we obtain: 

Theorem 14: Assume that M is a total yDMT transducer with unary output alphabet, A is a DTTA automaton. 
Then it is decidable whether or not two states jo > j'o of ^ equivalent relative to A. 

In the same way as in the last section. Theorem [14] provides us with a decision procedure for possibly partial 
unary yDMT transducers. We obtain our main technical result: 

Theorem 15: Equivalence for (possibly partial) unary yDMT transducers is decidable. 

VI. A More Practical Algorithm 

Clearly, checking all input trees is perhaps not the most systematic way of identifying a counter example to 
equivalence. Likewise, enumerating all mappings p i—)• Ip, in quest for a sufficiently strong inductive invariant 
seems completely impossible to be turned into a practical algorithm. Therefore, in this section we provide more 
realistic implementations of the two semi-algorithms to decide equivalence. 

To accomplish the task of identifying counter examples, we take a closer look at the greatest fixpoint iteration to 
determine the greatest inductive invariant p i—>• Ip. For p G P, let Ip^'^ = (1)q[z] = i-e-, the full polynomial 

ring, and for d > 0, define as 

( 11 ) 

= {r G Q[z]|r[r(fVz] G (-fi(xi) U ... U4(xfc))Q[z]} 

= I g = 1, ■ ■ ■, n, i = 0,..., 0 q[z] © 

(Ii(xi) U ... U 4(xfc))Q[z]) n Q[z] (12) 

For every p G P and d > 0, let domrf(p) denote the set of all input trees t G dom(p) of depth at most d where 
we consider leaves to have depth 1. Then we have: 

Lemma 16: 1) For every d > 0, = {r G Q[z] | Vf € domfc(p). r(|t]) = 0}; 

2) For every p G P, Ip = | d > 0}. 

Proof: Statement 1 follows by induction on d along the same lines as the proofs of Theorems |9] and [TO] 
Statement 2 follows from statement 1 as the intersection to the right consists of all polynomials r G Q[z] so that 
r(|t]) = 0 for all t G dom(p) — which precisely is the definition of Ip. ■ 

From statement 1 of Lemma [161 we conclude that there is a counter example to equivalence of q, q' of depth 
d iff Hq^qi 0 Tpg'^ for the initial state po of tho DTTA automaton B. Thus, the semi-algorithm for falsifying 
equivalence, can be formulated as: 

for (d > 0) { 

determine p i-> 1 ^^^; 

if {Hq q> 0 ipP) return “not equivalent”; 

} 


We turn to the efficient enumeration of candidate invariants. Let us again fix some bound d. This fime, the 
bound d is used as the degree bound to the polynomials to be considered during the fixpoint iteration. Let Qd[z] 
denote the set of all polynomials in Q[z] of total degree at most d. Here, the total degree of a polynomial r is 
the maximal sum of exponents of a monomial occurring in r. For an ideal I C Q[z], let us denote Xd as the 
intersection X n Q[z]. This set of polynomials can be considered as a vector space of finite dimension and can 
also be considered as a pseudo ideal in the sense of Colon Il5^ . 

The intersection Xd can be effectively computed as follows. Given a Grdbner basis G for X relative to some 
graded lexicographical ordering of monomials, it suffices to extract the subset Gd of polynomials in G of total 
degrees bounded by d. Then a set of generators of Xd considered as a vector space is given by the set of 
polynomials g ■ m with g £ Gd for monomials m with def(m) + deg(p) < d. Moreover, the ideal (Gd)Q[z] is the 
smallest ideal X' C X so that X’ n Qd[z] = X n Q[z]. 

Let ad denote the function that maps each ideal X to the corresponding ideal (Gd)Q[z]- Let denote the 
set of all ideals generated from Grdbner bases of total degree at most d. In this complete lattice, all decreasing 
sequences are ultimately stable. The idea is to compute increasingly precise abstractions of the mapping p Xp 
by means of the complete lattices B^. For d > 1, we put up the constraint system over B^, consisting of all 
constraints 

Ip C adilffiIp.,---Jp.) (13) 

for every transition p{p, f) = pi .. .pk- Since all right-hand sides are monotonic, the greatest solution of this 
system exists. Since B^ has finite descending chains only, the greatest solution is attained after finitely many 
fixpoint iterations. Let p i—>• Xp^d denote the resulting greatest solution. We have: 

Lemma 17: 1) For all d > 1, p i—)■ Xp ^ is an inductive invariant; 

2) For all d > 1, Xp^d ^ l-p^+i holds for every p £ P-, 

3) There exists some d > 1 such that Xp^d = Xp for all p £ P. 

Proof: The first two statements are obvious. In order to prove the third statement, consider the greatest 
inductive invariant p i—>• Xp. Thus in particular, Xp ^ C Xp for all p and d. For every state p, let Gp denote the 
Grdbner base of Xp. Let d denote the maximal global degree of any polynomial in the set lJ{Gp | p £ P}. Then 
p Xp is a solution of the constraint system ([T3l ). Accordingly, XpC Xp g. Hence Xp = Xp j, and statement 3 
follows. ■ 

In light of the argument for proving statement 3 of Lemma [TtJ we observe that for any d, p Xp^d represents 
the largest inductive invariant which can be represented by a Grdbner basis with maximal total degree d. Given 
Lemma [TTJ the semi-algorithm for verifying equivalence therefore looks as follows: 

for (d > 1) { 

determinep Xpd] 

if [Hq,,' £ Xpg d) return “equivalent”; 

} 

This algorithm now provides a systematic way to generate inductive invariants of increasing precision thus 
complementing the systematic enumeration method of counter examples of increasing depths. 

VH. From yDT transducers to unary yDMTT transducers 

In the following, we show that every total pDT transducer can be simulated by a total pDMT transducer with 
a unary output alphabet and polynomial size. This is the content of the next lemma. Assume that the output 
alphabet is given by A = [s]. By considering the elements of A as non-zero digits of the number system with 
base s -|- 1, each element w £ A* can be uniquely represented by a natural number. If tu = mi... Wk, wj £ [s], 
this number is given by [m]s+i = + - particular, [e]s+i = 0, i.e., the empty string is represented 

by 0. We have: 


Lemma 18: Assume that M is a total yDT transducer with set [n] of states and output alphabet [s]. Then a 
unary yDMT transducer N with the same set of states and a single parameter, can he constructed in polynomial 
time so that for every state q G [n] and input tree t, MivW(e) = [MM(f)]s+i- 
Moreover, if M is linear, then N is non-self-nested. 

Proof: Let M = (Q, S, A, go,-R) where Q = [n]. We define N = {Q,T.,{d},qo,R') as follows. For every 
rule q{f{xi,... ,Xk)) — )■ T in i? we let the rule q{f{xi,.. . ,Xk),yi) — > ll[T] he in R\ The parameter yi is 
meant to contain the right context (in unary). The mapping if [T] is defined as follows: 

U[aT] = a + (s + 1) • U[T] 

U[q'ixi)T] = qfxi,U[T]) 

U[e] = yi. 

For the yDMT transducer N we prove the following invariant: 

MAr(i)(M5-ri) = llolMit) U)]^+1. 

From that, the statement follows hy choosing q = qo and w = e.\n order to prove the invariant, we proceed hy 
induction on the structure of t. So assume that t = /(ti,..., t^), m > 0, where 5{q, f) = T. By induction, we 
may assume that the invariant already holds for ti,... ,tfc and all output words w. Then we prove that for all 
suhsequences T' of T and all words rj; G [s]* the following invariant holds: 

[if[T']i^ t (H,+i) = [[rvtu^]s+i 

where t = The invariant for t follows because {qj^ (t) (yi) = |if[T]]Ar t (yi) and |q]M(i) = 

|T]a^ t. If T' = e, we have that 

[leju t mjs+i = [w]s+i = [if[e]|Ar t (Hs-ri) 

and the invariant holds. 

If T' = aT" for some output symbol a G [s], we have that 

[|aT"]M t w]s+i = [alT"jM t w]s+i 

= a -|- (s -|- 1 ) • [[T'^Jat' t u^]s+i 

= a -|- (s -|- 1) • [if t ([wjs+i) by induction for T” 

= Iif[ar"]]7v t (H^+i) 

and the invariant holds, by induction, for T". 

Therefore, it remains to consider the case where T' = q'{xi)T''. Then: 

[lq'{xi)T"}M t w]s+i = [lq'}M{ti)lT"jM t w]s+i 

= t w]s+i) by induction for and w' = IT''\m t w 

= lQ'jN(ti)illd[T”]jN t (Hs+i)) by induction for T" 

= WWixi)T'']jN t (H^+i) 

and the assertion follows. — Obviously, if M is linear then N is non-self-nested. ■ 

VIII. Output in the Free Group 

In some applications, the output produced by a transducer cannot be considered as a sequence of uninterpreted 
letters, but consists in a sequence of operators. A first step in direction of dealing with such interpreted output is 
to consider the output to be an element of nfree group. In that, we go beyond the free monoid as output domain 
and assume that for each output letter a, there is an inverse letter a~ together with the cancellation rules 

aa~ = a~ a = e . 


And we ask whether equivalence of transducers remains decidable if output strings are considered equivalent up 
to applications of these rewrite rules. 



Let us first consider output generated in the free group Fi with a single generator. The free group Fi is 
isomorphic to the integral ring Z. This means that our construction for yDMT transducers with unary output 
alphabet can he readily applied also to yDMT transducers with output in Fi. We obtain: 

Theorem 19: Assume that M is a total yDMT transducer with output in the free group Fi, and A is a DTTA 
automaton. Then it is decidable whether or not two states jo,jo of M are equivalent relative to A. If M is 
non-self-nested, in-equivalence can even be decided in randomized polynomial time. ■ 

We now consider the case when we have two distinct output symbols a, h and together with their respective 
inverses a~ and b~, i.e., we consider outputs in the free group F 2 with two generators. We do not know how the 
result for Fi and yDMT transducers can be generalized to this more general situation. What we can do, however, 
is to consider yDT transducers where output symbols are interpreted as I x I matrices with entries in Q. Let 
Mi(Q) denote the monoid of all such matrices where the monoid operation is matrix multiplication. Given an 
interpretation a : A ^ A4i(Q) of output symbols, every output string w then represents a matrix a{w) G Mi(Q). 
Accordingly, the semantics |f] of an input tree t turns into a vector of matrices from Adi(Q), i.e., |f] € 
Moreover, every input symbol / of arity A; > 0 corresponds to a mapping: 

I/l : X ... X Qn-{ixi) 

(n the number of states of the transducer) which transforms the vectors corresponding to the semantics of the 
argument trees into the vector corresponding to the result vector of matrices for the whole input tree. By induction 
on the structure of right-hand sides, we find fhaf the entry (|/](xi,... ^ polynomial in the 

variables Xjg/;^/„/, f € {1,..., k}, g' G {1,... , n}, A', /r' G {1,..., /}. In the particular case that the yDT transducer 

{ f) 

is linear, each is multi-affine in the vectors xi,... ,Xfc. 

Example 4: Consider the two matrices 



■ 3 

1 ■ 


■ 3 

2 ■ 

mi = 

0 

1 

m 2 = 

0 

1 


Let A = { 01 , 02 }. The monoid homomorphism a : A 

a(ojj ... aj^) = 


3" w 
0 1 


Ad 2 (Q) mapping ai to rrii is injective and given by 

s-l 

where w = ^ 3^ • jx 
A=0 


In this way, the free monoid of strings over the alphabet A can be represented by the sub-monoid of Ad 2 
generated by mi, m 2 . Consider, e.g., a transition 

g(/(xi, X 2 )) —^ oi q'ix2) 02 g"(xi) 

of some deterministic yDT transducer with output in Ad 2 (Q) according to the given a. Then according to our 
definition. 


(|/|(xi,X2))g = mi • X2g' ■ m2 • Xlq-/ 

X2g'll X2g'12 


-1 

CO 

1 ■ 


1- 

0 

1 



3 2 
0 1 


Xlg"ll Xiq'/12 
Xlg"21 ^lq"22 


Let denote the set of polynomials ~ ^ (I; • • • > ^}- Then the states Joj Jo equivalent 

relative to the DTTA automaton A iff r(|f]) = 0 for all r G and t G dom(po)- In general, this can be 

decided by the algorithm which we applied in the proof of Theorem [141 but adapted to the new construction of 

( f) 

the polynomials and where the single linear target equality LAjojo replaced with the conjunction of the 
finitely many target equalities from TLjoj’g- In the case of linear yDT transducers, the algorithm from Section HVl 
can be adapted accordingly. Therefore, we obtain: 
















Theorem 20: Assume that M is a total yDT transducer with output in the monoid and A is a DTTA 

automaton. Then it is decidable whether or not two states jo,jQ of M are equivalent relative to A. If the yDT 
transducer is linear and the ranks of the input symbols is bounded, in-equivalence is decidable in randomized 
polynomial time. ■ 

We remark that extending the matrix monoid considered in Example |4] with inverses, would not provide us with 
a free group. Instead, however, we can consider the subgroup of 2 x 2 matrices generated from the elements: 


This group is also known as Sanov group S (see, e.g.. Example 4.5.1 of |[54ll ). Since both matrices have determinant 
1, all elements in S have integer coefficients only. In particular, the inverses of the two generators are given by: 


a = 


1 0 
2 1 


a 


1 0 

-2 1 



-2 

1 


The Sanov group is particularly useful for us, since S with subgroup generators a and b is isomorphic to the 
free group freely generated from a, b. Thus, our results on yDT transducers with output in matrix monoids in 
Theorem [20l gives us: 

Theorem 21: Assume that M is a total yDT transducer with output in the free group IF 2 , and A is a DTTA 
automaton. Then it is decidable whether or not two states jo,jo of M are equivalent relative to A. If the yDT 
transducers are linear and the ranks of their input symbols are bounded, in-equivalence is decidable in randomized 
polynomial time. ■ 

Since the free group with two generators has a free group with I > 2 generators as a subgroup. Theorem |2l] 
implies that equivalence of total yDT transducers relative to some DTTA automaton is also decidable when the 
outputs are in a free group with I > 2 generators. 

Eemma [18] allows to apply our decision procedures for unary yDMT transducers to decide equivalence for 
yDT transducers with arbitrary output alphabets. Via Eemma [TH equivalence for linear (possibly partial) yDT 
transducers is reduced to equivalence of non-self-nested unary yDMT transducers, while equivalence for arbitrary 
(possibly partial and non-linear) yDT transducers is reduced to equivalence of general unary yDMT transducers. 
In summary, we obtain: 

Theorem 22: Equivalence of arbitrary yDT transducers is decidable. If the yDT transducers are linear and the 
ranks of their input symbols are bounded, in-equivalence is decidable in randomized polynomial time. 

The second part of Theorem [22] follows from Theorem |7] and Eemma [T] One particular subcase of Theorem |22| 
is when the input alphabet is monadic. This case is known to be equivalent to the sequence equivalence problem 
of hdtOl systems ll^ . lIMI . By Eemma dH this problem can be reduced to the equivalence problem for unary 
yDMT transducers with monadic output alphabet, for which a direct algorithm based on fixpoint iteration over 
polynomial ideals has been presented in the last section. The equivalence problem for yDT transducers with 
non-monadic input alphabets, as shown to be decidable in Theorem |22j seems to be significantly more difficult. 


IX. Application to other Types of Transducers 

Tree transducers can be equipped with regular look-ahead. Eor top-down transducers this increases the expres¬ 
sive power. A top-down or macro tree-to-string transducer with regular look-ahead (yDT^ and yDMT^ transducer) 
consists of an ordinary such transducer together with a complete deterministic bottom-up tree automaton B. Eor 
a yDT^ transducer, a rule is of the form 

Q{f{xi : Pi , . . . , Xjyi ■ Pm)) T 

where T is as for ordinary yDT transducers, and the pi are states of B. The rule is applicable to an input tree 
ft if B arrives in state pi on input tree tj for every z G {1,..., m}. Our result extends to look-ahead, using the 










technique as in ll25l : one changes the input ranked alphabet to contain state information of B, and changes the 
transducer to check the correctness of the information. 

By a result of ll46l the class of translations realized hy yDT^ transducers is equal to the class realized hy 
macro transducers which use each parameter in a rule precisely once (and have look-ahead). In fact, hy the results 
of we can state the result in terms of yDMT^ transducers that are finite-copying in the parameters (yDMT^p 
transducers). A yDMT^ transducer is finite-copying in the parameters if there exists a k such that for every input 
tree s, state q (of rank I 1), and j G [(], the number of occurrences of pj in |g](s) is < k. 

Corollary 23: Equivalence of yDT^ transducers and 7/DMT^p transducers is decidable. 

A variation of transducers that has been considered in the context of XML, are transducers of unranked trees. In 
an unranked tree, the number of children of a node is not determined by the label of that node, but is independent. 

For instance the term a{a{a,b),a,a,a) represents an unranked tree. XMLdocument trees are naturally modeled 
by unranked trees. 

There are several models of top-down tree transducers for unranked trees. In |[2^ macro forest transducers, 
and their parameterless version, the forest transducers, are defined. 

The rules of a forest transducer are very similar to the rules of a yDT transducer and are of the form 
q{a{xi,X 2 ) —> T where T is a string as in a yDT transducer, with the only difference that T may contain 
special symbols “(” and “)” of opening and closing brackets, and if so, then T must be well-balanced with 
respect to these brackets. If such a rule is applied to an unranked o-labeled node u, then xi represents the 
first subtree of u, and X 2 represents the next sibling of u. The special bracket symbols in right-hand sides have 
the obvious interpretation of generating a tree structure. Obviously, when checking equivalence of two forest 
transducers, we may consider their output as strings. Thus, the equivalence problem for deterministic forest 
transducers is just a direct instance of the equivalence problem for yDT transducers. 

Corollary 24: Equivalence is decidable for deterministic forest transducers. 

Another, much earlier model of unranked top-down tree transducers is the unranked top-down tree trans¬ 
ducer Q- Instead of state calls q{xi) as in an ordinary ranked top-down tree transducer, they use calls of the 
form L where L is a regular language over the set of states Q of the transducers, plus the special symbol 0. If 
the current input node has A:-many children, then a word of length k from L is chosen in order to determine 
which states translate the children nodes (where 0 means that no state translates the corresponding node). Such 
a transducer is deterministic, if for every k and every L in the right-hand side of a rule, L contains at most one 
string of length k. As an example, consider the unranked top-down tree transducer with the following rules: 

yo(a(---)) ^ a{L) 
q{a{---)) a{L)L 

where L is the regular language q* consting of all strings of the form qq - ■ ■ q. For the input tree s = a{a{a{a))), 
this transducer first applies the first rule to obtain a{q{si)) where si = a{a{a)). We now apply the second rule to 
obtain a{a{q{s 2 ))q{s 2 )) where S 2 = a{a). Two more applications of the second rule give a{a{a{q{a))q{a))a{q{a))q{a)), 
and finally we obtain the output tree a{a{a{a)a)a{a)a). 

As mentioned by Perst and Seidl, any unranked top-down tree transducer can be realized by a forest transducer. 
However, they only mention this for nondeterministic transducers. Thus, given a deterministic unranked top-down 
tree transducer M, we can construct an equivalent nondeterministic forest transducer N. It follows from the 
explanation above that we may consider as a nondeterministic top-down tree-to-string transducer. 

By an old result Il55l . for any functional DT transducer, an equivalent DT^ transducer can be constructed. 
Since “yield” which turns a tree into its string of leaf symbols is a function, it directly follows that also for any 
functional yDT transducer, one can construct an equivalent yDT*^ transducer. Thus, we can construct for N an 
equivalent yDT^ transducer, for which equivalence is decidable by Corollary 

Corollary 25: Equivalence is decidable for deterministic unranked top-down tree transducers. 

Moreover by the constructions from section IVTTTI the results stated in in corollaries I23I24I and |25] also hold when 
output is not considered in the free monoid, but in the free group. 






X. Conclusion 


We present algorithms for deciding equivalence of deterministic top-down tree-to-string transducers (yDT trans¬ 
ducers). For yDT transducers with general output alphabets, we provide a construction which encodes outputs 
over arbitrary output alphabets into outputs over a unary alphabet. This construction requires to introduce an 
extra parameter. For arbitrary yDT transducers, it results in yDMT transducers with unary output alphabet, which 
are non-self-nested whenever the original yDT transducer is linear. For the case of non-self-nested unary yDMT 
transducers, we show that multi-affine mappings and affine spaces are sufficient to decide equivalence, whereas 
in the general case, we had to resort to polynomial ideals. 

The key concept which helped us to arrive at a decision procedure in the general case, are inductive invariants 
certifying assertions. While such invariants can be automatically inferred for monadic input alphabets, we were less 
explicit for non-monadic input alphabets. Here, we only prove that an inductive invariant certifying a polynomial 
equality exists, whenever the equality holds. Since enumerating all inductive invariants is rather impractical, we 
presented a more explicit method which allows to systematically construct the best inductive invariant up to a given 
maximal degree. Together with an explicit enumeration of potential counter-examples, decidability of equivalence 
of arbitrary yDMT transducers with unary output alphabet follows. This result means that Abelian equivalence, 
i.e., equivalence up to the ordering of symbols in the output, is decidable for general yDMT transducers. The 
same holds true for growth equivalence where only the lengths of output strings matter. 

By means of our simulation of arbitrary output alphabets with unary ones, we obtained a randomized polynomial 
algorithm for deciding in-equivalence of linear t/DT transducers. The strongest result, however, is decidability of 
equivalence of general yDT transducers with arbitrary output alphabets. Both algorithms are then extended to the 
case when output is no longer considered to be in a free monoid, but in a free group. 

Still, our decision procedures leave room for generalizations. The equivalence problem for yDMT transducers 
with unary output alphabet being solved, the equivalence problems (as stated in ||2^ ) for r/DMT transducers with 
arbitrary output alphabets, and even for DMT transducers with tree output, remain open. 
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